home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Tech Arsenal 1
/
Tech Arsenal (Arsenal Computer).ISO
/
tek-02
/
kruse_11.zip
/
11DOC.DOC
< prev
next >
Wrap
Text File
|
1990-12-01
|
6KB
|
141 lines
11TEXT.EXT 54
Program segments from the textbook
11SOL.EXT 37
Program segments from the solutions manual
Versions of the index-writer program:
INDEXTEXT.PAS 62
This is the basic version, directly from the textbook. It will compile
under VAX/VMS Pascal, but it may not execute, since it sets up an array
of 26 local files, which may exceed the limit set by the operating system.
INDEXWRITE.PAS 71
This is a more flexible version of the index writer including the options of
categorizing text words as page reference and questionable, as well as count,
place in hash table, and index. The program uses fewer temporary files and
hence will run under more operating systems. The program assumes the exist-
ence of a file HASHFILE.DAT in the default directory. File NEWHASHFILE.DAT
stores all the new hash words encountered by the program. This program only
implements Phases I and II of the total index writer program desribed in the
textbook. Should a list of index or page references exceed a maximum record
length of 130, additional references will be placed on a separate line. This
file format is compatible with all other utilities in this directory.
INDEXTEX.PAS 74
This version (the one usually used) modifies the previous program in order to
skip over commands of the TeX typesetting system in the input file, that is
over all sequences beginning with a backslash (\) followed either by a single
nonalphabetic character or any sequence of letters from the alphabet (either
upper- or lowercase). In addition, the program uses the appearance of a TeX
command of the form \*page, where * represents any single letter, to indicate
the end of one page. The text of the book, for example, contains the TeX
command \epage at the end of each page.
ADDHASH.PAS 7
This is the first part of Phase 3 of the total index writer program. Files
HASHFILE.DAT and NEWHASHFILE.DAT are read and merged. The new file is
written to a file as specified by the user.
MERGEWORDS.PAS 18
This is the second part of Phase 3 of the total index writer program. The
master word file and an index file are read in, merged, and written to a file
as specified by the user.
Other utility programs related to the index writer:
MAKEHASH.PAS 9
Reads in a list (text file) of words (one per line) to be placed into a
hash file and writes the hashed file to a user specified file.
MERGEHASH.PAS 5
Combines the counts from two hash-table files, producing a new file
REMDUPL.PAS 4
This utility reads in a master word list and an index of words. All words
in the second list which also appear in the first list are deleted, and the
shortened list is written out to a user-specified file.
REMREF.PAS 3
This utility strips the given index file of all page references and word
counts. It will also remove any lines of continued references that may
appear.
SORTHASH.PAS 16
Sorts a hash-table file into order either alphabetically or by word
frequency, producing new file(s) specified by the user
Data files used by the index writer:
ININDEX.DAT 306
Master word list to be used by the index writer, consisting of 6508 words,
all the distinct words (except those in the hash table) appearing in the
textbook, the instructor's manual, and the solutions manual, together with
a few others.
HASHWORDS.DAT 50
This is a list of 1067 common words to be placed in the hash file by the
utility MAKEHASH.PAS. It is sorted approximately so as to place the more
frequently used words into the table first so that they may be accessed in
less time.
HASHFILE.DAT 44
Be sure that this is in the default directory while running the index writer
program. The file is in hashed format. This file, as supplied, is suitable
for a Vax computer under VMS and DEC Pascal. With other systems, the hash
table should be generated by running the program MAKEHASH.PAS on the file
HASHWORDS.DAT.
INTEXT.DAT 6
This is a short, sample text file (extracted from the preface to the text)
that may be used to test the index-writer programs.
Word lists generated by the index writer:
TEXT.ALH 59
TEXT.FRH 59
TEXT.WRD 365
SOLUTIONS.ALH 59
SOLUTIONS.FRH 59
SOLUTIONS.WRD 204
INSTRUCTOR.ALH 59
INSTRUCTOR.FRH 59
INSTRUCTOR.WRD 150
These files contain the lists of all words (concordances) for the textbook
(total of 201,399 words), for the instructor's manual (46,571 words), and for
the solutions manual (106,197 words), respectively. The files *.ALH are the
words from the hash tables, sorted alphabetically; the files *.FRH are the
words from the hash tables sorted in decreasing frequency of appearance; and
the files *.WRD contain all remaining words (most with a complete list of
page references).
Solutions to programming projects:
INDEXFILES.PAS 70
Modification of the basic program to use fewer temporary files (solution
to Section 11.3, Exercise E6)
INDEXSORT.PAS 61
Modification that sorts lists of words before comparing with the master
word list (solution to Section 11.4, Exercise E3)
INDEXHASH.PAS 64
Modification to use a second hash table to amalgamate references before
comparison with the master word list (solution to Section 11.4, Exercise E4)